Serveur d'exploration sur le peuplier

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

BowStrap v1.0: Assigning statistical significance to expressed genes using short-read transcriptome data.

Identifieur interne : 002B78 ( Main/Exploration ); précédent : 002B77; suivant : 002B79

BowStrap v1.0: Assigning statistical significance to expressed genes using short-read transcriptome data.

Auteurs : Peter E. Larsen [États-Unis] ; Frank R. Collart

Source :

RBID : pubmed:22676709

Descripteurs français

English descriptors

Abstract

BACKGROUND

Background: Deep RNA sequencing, the application of Next Generation sequencing technology to generate a comprehensive profile of the message RNA present in a set of biological samples, provides unprecedented resolution into the molecular foundations of biological processes. By aligning short read RNA sequence data to a set of gene models, expression patterns for all of the genes and gene variants in a biological sample can be calculated. However, accurate determination of gene model expression from deep RNA sequencing is hindered by the presence of ambiguously aligning short read sequences.

FINDINGS

BowStrap, a program for implementing the sequence alignment tool 'Bowtie' in a bootstrap-style approach, accommodates multiply-aligning short read sequences and reports gene model expression as an averaged aligned reads per Kb of gene model sequence per million aligned deep RNA sequence reads with a confidence interval, suitable for calculating statistical significance of presence/absence of detected gene model expression. BowStrap v1.0 was validated against a simulated metatranscriptome. Results were compared with two alternate 'Bowtie'-based calculations of gene model expression. BowStrap is better at accurately identifying expressed gene models in a dataset and provides a more accurate estimate of gene model expression level than methods that do not incorporate a boot-strap style approach.

CONCLUSIONS

BowStrap v1.0 is superior in ability to detect significant gene model expression and calculate accurate determination of gene model expression levels compared to other alignment-based methods of determining patterns of gene expression. BowStrap v1.0 also can utilize multiple processors as has decreased run time compared to the previous version, BowStrap 0.5. We anticipate that BowStrap will be a highly useful addition to the available set of Next Generation RNA sequence analysis tools.


DOI: 10.1186/1756-0500-5-275
PubMed: 22676709
PubMed Central: PMC3494516


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">BowStrap v1.0: Assigning statistical significance to expressed genes using short-read transcriptome data.</title>
<author>
<name sortKey="Larsen, Peter E" sort="Larsen, Peter E" uniqKey="Larsen P" first="Peter E" last="Larsen">Peter E. Larsen</name>
<affiliation wicri:level="1">
<nlm:affiliation>Biosciences Division, Argonne National Laboratory, Lemont, IL, 60490, USA. plarsen@anl.gov</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Biosciences Division, Argonne National Laboratory, Lemont, IL, 60490</wicri:regionArea>
<wicri:noRegion>60490</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Collart, Frank R" sort="Collart, Frank R" uniqKey="Collart F" first="Frank R" last="Collart">Frank R. Collart</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2012">2012</date>
<idno type="RBID">pubmed:22676709</idno>
<idno type="pmid">22676709</idno>
<idno type="doi">10.1186/1756-0500-5-275</idno>
<idno type="pmc">PMC3494516</idno>
<idno type="wicri:Area/Main/Corpus">002A07</idno>
<idno type="wicri:explorRef" wicri:stream="Main" wicri:step="Corpus" wicri:corpus="PubMed">002A07</idno>
<idno type="wicri:Area/Main/Curation">002A07</idno>
<idno type="wicri:explorRef" wicri:stream="Main" wicri:step="Curation">002A07</idno>
<idno type="wicri:Area/Main/Exploration">002A07</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">BowStrap v1.0: Assigning statistical significance to expressed genes using short-read transcriptome data.</title>
<author>
<name sortKey="Larsen, Peter E" sort="Larsen, Peter E" uniqKey="Larsen P" first="Peter E" last="Larsen">Peter E. Larsen</name>
<affiliation wicri:level="1">
<nlm:affiliation>Biosciences Division, Argonne National Laboratory, Lemont, IL, 60490, USA. plarsen@anl.gov</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Biosciences Division, Argonne National Laboratory, Lemont, IL, 60490</wicri:regionArea>
<wicri:noRegion>60490</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Collart, Frank R" sort="Collart, Frank R" uniqKey="Collart F" first="Frank R" last="Collart">Frank R. Collart</name>
</author>
</analytic>
<series>
<title level="j">BMC research notes</title>
<idno type="eISSN">1756-0500</idno>
<imprint>
<date when="2012" type="published">2012</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithms (MeSH)</term>
<term>Databases, Nucleic Acid (MeSH)</term>
<term>Gene Expression (MeSH)</term>
<term>Gene Expression Profiling (methods)</term>
<term>Gene Expression Profiling (statistics & numerical data)</term>
<term>Genes, Synthetic (MeSH)</term>
<term>High-Throughput Nucleotide Sequencing (MeSH)</term>
<term>Laccaria (genetics)</term>
<term>Populus (genetics)</term>
<term>Sequence Alignment (MeSH)</term>
<term>Sequence Analysis, RNA (methods)</term>
<term>Sequence Analysis, RNA (statistics & numerical data)</term>
<term>Software (MeSH)</term>
<term>Transcriptome (MeSH)</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr">
<term>Algorithmes (MeSH)</term>
<term>Alignement de séquences (MeSH)</term>
<term>Analyse de profil d'expression de gènes (méthodes)</term>
<term>Analyse de profil d'expression de gènes (statistiques et données numériques)</term>
<term>Analyse de séquence d'ARN (méthodes)</term>
<term>Analyse de séquence d'ARN (statistiques et données numériques)</term>
<term>Bases de données d'acides nucléiques (MeSH)</term>
<term>Expression des gènes (MeSH)</term>
<term>Gènes de synthèse (MeSH)</term>
<term>Laccaria (génétique)</term>
<term>Logiciel (MeSH)</term>
<term>Populus (génétique)</term>
<term>Séquençage nucléotidique à haut débit (MeSH)</term>
<term>Transcriptome (MeSH)</term>
</keywords>
<keywords scheme="MESH" qualifier="genetics" xml:lang="en">
<term>Laccaria</term>
<term>Populus</term>
</keywords>
<keywords scheme="MESH" qualifier="génétique" xml:lang="fr">
<term>Laccaria</term>
<term>Populus</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>Gene Expression Profiling</term>
<term>Sequence Analysis, RNA</term>
</keywords>
<keywords scheme="MESH" qualifier="méthodes" xml:lang="fr">
<term>Analyse de profil d'expression de gènes</term>
<term>Analyse de séquence d'ARN</term>
</keywords>
<keywords scheme="MESH" qualifier="statistics & numerical data" xml:lang="en">
<term>Gene Expression Profiling</term>
<term>Sequence Analysis, RNA</term>
</keywords>
<keywords scheme="MESH" qualifier="statistiques et données numériques" xml:lang="fr">
<term>Analyse de profil d'expression de gènes</term>
<term>Analyse de séquence d'ARN</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Algorithms</term>
<term>Databases, Nucleic Acid</term>
<term>Gene Expression</term>
<term>Genes, Synthetic</term>
<term>High-Throughput Nucleotide Sequencing</term>
<term>Sequence Alignment</term>
<term>Software</term>
<term>Transcriptome</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr">
<term>Algorithmes</term>
<term>Alignement de séquences</term>
<term>Bases de données d'acides nucléiques</term>
<term>Expression des gènes</term>
<term>Gènes de synthèse</term>
<term>Logiciel</term>
<term>Séquençage nucléotidique à haut débit</term>
<term>Transcriptome</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>
<b>BACKGROUND</b>
</p>
<p>Background: Deep RNA sequencing, the application of Next Generation sequencing technology to generate a comprehensive profile of the message RNA present in a set of biological samples, provides unprecedented resolution into the molecular foundations of biological processes. By aligning short read RNA sequence data to a set of gene models, expression patterns for all of the genes and gene variants in a biological sample can be calculated. However, accurate determination of gene model expression from deep RNA sequencing is hindered by the presence of ambiguously aligning short read sequences.</p>
</div>
<div type="abstract" xml:lang="en">
<p>
<b>FINDINGS</b>
</p>
<p>BowStrap, a program for implementing the sequence alignment tool 'Bowtie' in a bootstrap-style approach, accommodates multiply-aligning short read sequences and reports gene model expression as an averaged aligned reads per Kb of gene model sequence per million aligned deep RNA sequence reads with a confidence interval, suitable for calculating statistical significance of presence/absence of detected gene model expression. BowStrap v1.0 was validated against a simulated metatranscriptome. Results were compared with two alternate 'Bowtie'-based calculations of gene model expression. BowStrap is better at accurately identifying expressed gene models in a dataset and provides a more accurate estimate of gene model expression level than methods that do not incorporate a boot-strap style approach.</p>
</div>
<div type="abstract" xml:lang="en">
<p>
<b>CONCLUSIONS</b>
</p>
<p>BowStrap v1.0 is superior in ability to detect significant gene model expression and calculate accurate determination of gene model expression levels compared to other alignment-based methods of determining patterns of gene expression. BowStrap v1.0 also can utilize multiple processors as has decreased run time compared to the previous version, BowStrap 0.5. We anticipate that BowStrap will be a highly useful addition to the available set of Next Generation RNA sequence analysis tools.</p>
</div>
</front>
</TEI>
<pubmed>
<MedlineCitation Status="MEDLINE" Owner="NLM">
<PMID Version="1">22676709</PMID>
<DateCompleted>
<Year>2013</Year>
<Month>02</Month>
<Day>12</Day>
</DateCompleted>
<DateRevised>
<Year>2018</Year>
<Month>11</Month>
<Day>13</Day>
</DateRevised>
<Article PubModel="Electronic">
<Journal>
<ISSN IssnType="Electronic">1756-0500</ISSN>
<JournalIssue CitedMedium="Internet">
<Volume>5</Volume>
<PubDate>
<Year>2012</Year>
<Month>Jun</Month>
<Day>07</Day>
</PubDate>
</JournalIssue>
<Title>BMC research notes</Title>
<ISOAbbreviation>BMC Res Notes</ISOAbbreviation>
</Journal>
<ArticleTitle>BowStrap v1.0: Assigning statistical significance to expressed genes using short-read transcriptome data.</ArticleTitle>
<Pagination>
<MedlinePgn>275</MedlinePgn>
</Pagination>
<ELocationID EIdType="doi" ValidYN="Y">10.1186/1756-0500-5-275</ELocationID>
<Abstract>
<AbstractText Label="BACKGROUND" NlmCategory="BACKGROUND">Background: Deep RNA sequencing, the application of Next Generation sequencing technology to generate a comprehensive profile of the message RNA present in a set of biological samples, provides unprecedented resolution into the molecular foundations of biological processes. By aligning short read RNA sequence data to a set of gene models, expression patterns for all of the genes and gene variants in a biological sample can be calculated. However, accurate determination of gene model expression from deep RNA sequencing is hindered by the presence of ambiguously aligning short read sequences.</AbstractText>
<AbstractText Label="FINDINGS" NlmCategory="RESULTS">BowStrap, a program for implementing the sequence alignment tool 'Bowtie' in a bootstrap-style approach, accommodates multiply-aligning short read sequences and reports gene model expression as an averaged aligned reads per Kb of gene model sequence per million aligned deep RNA sequence reads with a confidence interval, suitable for calculating statistical significance of presence/absence of detected gene model expression. BowStrap v1.0 was validated against a simulated metatranscriptome. Results were compared with two alternate 'Bowtie'-based calculations of gene model expression. BowStrap is better at accurately identifying expressed gene models in a dataset and provides a more accurate estimate of gene model expression level than methods that do not incorporate a boot-strap style approach.</AbstractText>
<AbstractText Label="CONCLUSIONS" NlmCategory="CONCLUSIONS">BowStrap v1.0 is superior in ability to detect significant gene model expression and calculate accurate determination of gene model expression levels compared to other alignment-based methods of determining patterns of gene expression. BowStrap v1.0 also can utilize multiple processors as has decreased run time compared to the previous version, BowStrap 0.5. We anticipate that BowStrap will be a highly useful addition to the available set of Next Generation RNA sequence analysis tools.</AbstractText>
</Abstract>
<AuthorList CompleteYN="Y">
<Author ValidYN="Y">
<LastName>Larsen</LastName>
<ForeName>Peter E</ForeName>
<Initials>PE</Initials>
<AffiliationInfo>
<Affiliation>Biosciences Division, Argonne National Laboratory, Lemont, IL, 60490, USA. plarsen@anl.gov</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Collart</LastName>
<ForeName>Frank R</ForeName>
<Initials>FR</Initials>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList>
<PublicationType UI="D016428">Journal Article</PublicationType>
<PublicationType UI="D013486">Research Support, U.S. Gov't, Non-P.H.S.</PublicationType>
</PublicationTypeList>
<ArticleDate DateType="Electronic">
<Year>2012</Year>
<Month>06</Month>
<Day>07</Day>
</ArticleDate>
</Article>
<MedlineJournalInfo>
<Country>England</Country>
<MedlineTA>BMC Res Notes</MedlineTA>
<NlmUniqueID>101462768</NlmUniqueID>
<ISSNLinking>1756-0500</ISSNLinking>
</MedlineJournalInfo>
<CitationSubset>IM</CitationSubset>
<MeshHeadingList>
<MeshHeading>
<DescriptorName UI="D000465" MajorTopicYN="N">Algorithms</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D030561" MajorTopicYN="N">Databases, Nucleic Acid</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D015870" MajorTopicYN="Y">Gene Expression</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D020869" MajorTopicYN="N">Gene Expression Profiling</DescriptorName>
<QualifierName UI="Q000379" MajorTopicYN="Y">methods</QualifierName>
<QualifierName UI="Q000706" MajorTopicYN="N">statistics & numerical data</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D005813" MajorTopicYN="Y">Genes, Synthetic</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D059014" MajorTopicYN="N">High-Throughput Nucleotide Sequencing</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D055399" MajorTopicYN="N">Laccaria</DescriptorName>
<QualifierName UI="Q000235" MajorTopicYN="N">genetics</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D032107" MajorTopicYN="N">Populus</DescriptorName>
<QualifierName UI="Q000235" MajorTopicYN="N">genetics</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D016415" MajorTopicYN="N">Sequence Alignment</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D017423" MajorTopicYN="N">Sequence Analysis, RNA</DescriptorName>
<QualifierName UI="Q000379" MajorTopicYN="Y">methods</QualifierName>
<QualifierName UI="Q000706" MajorTopicYN="N">statistics & numerical data</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D012984" MajorTopicYN="Y">Software</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D059467" MajorTopicYN="Y">Transcriptome</DescriptorName>
</MeshHeading>
</MeshHeadingList>
</MedlineCitation>
<PubmedData>
<History>
<PubMedPubDate PubStatus="received">
<Year>2012</Year>
<Month>02</Month>
<Day>17</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="accepted">
<Year>2012</Year>
<Month>05</Month>
<Day>25</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="entrez">
<Year>2012</Year>
<Month>6</Month>
<Day>9</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="pubmed">
<Year>2012</Year>
<Month>6</Month>
<Day>9</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline">
<Year>2013</Year>
<Month>2</Month>
<Day>13</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>epublish</PublicationStatus>
<ArticleIdList>
<ArticleId IdType="pubmed">22676709</ArticleId>
<ArticleId IdType="pii">1756-0500-5-275</ArticleId>
<ArticleId IdType="doi">10.1186/1756-0500-5-275</ArticleId>
<ArticleId IdType="pmc">PMC3494516</ArticleId>
</ArticleIdList>
<ReferenceList>
<Reference>
<Citation>Science. 2006 Sep 15;313(5793):1596-604</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">16973872</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Nat Methods. 2008 Feb;5(2):183-8</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">18204455</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Science. 2008 Jun 6;320(5881):1344-9</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">18451266</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Nat Methods. 2008 Jul;5(7):621-8</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">18516045</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Genome Res. 2008 Sep;18(9):1509-17</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">18550803</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Genome Biol. 2011;12(3):R22</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21410973</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2009 Sep 1;25(17):2194-9</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">19549630</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>PLoS One. 2010;5(7):e9780</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">20625404</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Curr Protoc Bioinformatics. 2010 Dec;Chapter 11:Unit 11.7</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21154709</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>BMC Syst Biol. 2011;5:70</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21569493</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>New Phytol. 2008;180(2):296-310</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">19138220</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
</PubmedData>
</pubmed>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
</list>
<tree>
<noCountry>
<name sortKey="Collart, Frank R" sort="Collart, Frank R" uniqKey="Collart F" first="Frank R" last="Collart">Frank R. Collart</name>
</noCountry>
<country name="États-Unis">
<noRegion>
<name sortKey="Larsen, Peter E" sort="Larsen, Peter E" uniqKey="Larsen P" first="Peter E" last="Larsen">Peter E. Larsen</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Bois/explor/PoplarV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002B78 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002B78 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Bois
   |area=    PoplarV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     pubmed:22676709
   |texte=   BowStrap v1.0: Assigning statistical significance to expressed genes using short-read transcriptome data.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:22676709" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a PoplarV1 

Wicri

This area was generated with Dilib version V0.6.37.
Data generation: Wed Nov 18 12:07:19 2020. Site generation: Wed Nov 18 12:16:31 2020